Spoken-style explanation generator for Japanese kanji using a text-to-speech system
نویسندگان
چکیده
In this paper we describe a spoken explanation generator, PLANET, for Japanese Kanji (ideograms), especially Kanji used in people's names. A number of text-to-speech systems for Kanji texts have been proposed but this is the rst one that can explain Kanji characters so as to disambiguate characters from many homophone Kanji candidates. To accomplish this the generator explains the Kanji by using both internal composition and use in other words. The system has a database of over 6,000 Kanji characters that breaks down any given Kanji into its components and a text corpus of explanations for explaining two or more Kanji characters. It is capable of generating both other words that include the Kanji characters in question, and identifying information. Using these other words and the information the system makes phrases and sentences. Furthermore, this system generates natural prosodic information by classifying the pattern of semantic connections between words and phrases. The explanations are output through a natural-sounding voice synthesizer. Hearing examinations con rmed that this system achieves high accuracy in disambiguating Kanji characters from among many candidates. This system will make it possible to provide advanced and user-friendly human-computer interfaces.
منابع مشابه
Sentence-Style Conversion of Japanese News Article for Text-to-Speech Application
This paper proposes a method of sentence-style conversion for generating spontaneous Japanese speech in a text-to-speech synthesis system. Since written language is different from spoken language linguistically, the speech by direct reading of written Japanese texts might be unnatural. The method takes a fully rulebased approach to convert the sentence style and to complement sentences, which a...
متن کاملParallel Speech Corpora of Japanese Dialects
Clean speech data is necessary for spoken language processing, however, there is no public Japanese dialect corpus collected for speech processing. Parallel speech corpora of dialect are also important because real dialect affects each other, however, the existing data only includes noisy speech data of dialects and their translation in common language. In this paper, we collected parallel spee...
متن کاملAdding Japanese language synthesis support to the eSpeak system
In this paper we describe an addition to the eSpeak system which is capable of pronouncing Japanese language. This implementation is used for automatic segmentation of Japanese speech from Japanese text. The speech synthesiser that we use is part of the praat speech analyse program and is based on the eSpeak text-to-speech engine. Because the Japanese writing system is very complex i.e. it mixe...
متن کاملSpoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting
Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...
متن کاملRecent Topics in Speech Recognition Research at NTT Laboratories
This paper introduces three recent topics in speech recognition research at NTT (Nippon Telegraph and Telephone) Human Interface Laboratories. The first topic is a new HMM (hidden Markov model) technique that uses VQ-code bigrams to constrain the output probability distribution of the model according to the VQ-codes of previons frames. The output probability distribution changes depending on th...
متن کامل